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Gene Sequence of sddl7, an Omega-3 Fatty Acid Des^ rase Gene 
from Saprolegma dicUna (ATCC 56351) 



1 ATGACTGAGG ATAAGACGAA GGTCGAGTTC CCGACGCTCA CGGAGCTCAA 

ci rrATTCGATC CCGAACGCGT GCTTTGAGTC GAACCTCGGC CTCTCGCTCT 

101 ACTACACGGC CCGCGCGATC TTCAACGCGT CGGCCTCGGC GGCGCTGCTC 

Si TACGCGGCGC GCTCGACGCC GTTCATTGCC GATAACGTTC JGCTCCACGC 

201 GCTCGTTTGC GCCACCTACA TCTACGTGCA GGGCGTCATC TTCTGGGGCT 

?51 TCTTCACGGT CGGCCACGAC TGCGGCCACT CGGCCTTCTC GCGCTACCAC 

?ni ArrrTPAACT TTATCATCGG CTGCATCATG CACTCTGCGA TTTTGACGCC 

151 Sttcgagagc Iggcgcgtga cgcaccgcca ccaccacaag aacacgggca 

5ni APATTfiATAA GGACGAGATC TTTTACCCGC ACCGGTCGGT CAAGGACCTC 
Si CAGGACGTGC GCCAATGGGT CTACACGCTC G6CGGTGCGT p^TTIpIpIg 
Si PTTGAAGGTC GGGTATGCCC CGCGCACGAT GAGCCACTTT GACCCGTGGG 
551 SoQCTCCT ttTCGCCGC GCGTCGGCCG TCATCGTGTC GCTCGGCGTC 
rrrrrrrrn TCTTCGCCGC GTACGCGTAC CTCACATACT CGCTCGGCII 
!K SSrT§TG GGCCTCTACT ACTATGCGCC GCTCTTTGTC TTTGCTTCGT 

7ni SFnSKr tacgaccttc TTGCACCACA acgacgaagc GACGCCGTGG 
751 ScSct Jggagtggac gtacgtcaag ggcaacctct cgagcgtcga 

Si rrrrTCGTAC GGCGCGTTCG TGGACAACGT GAGCCACCAC ATTGGCACGC 

ll\ ACCAGGTCCA CCACTTGTTC CCGATCATTC CGCACTACAA GCTCAACGAA 

9ol GCCACCAAGC ACTTTGCGGC CGCGTACCCG CACCTCGTGC GCAGGAACGA 

951 CGAGCCCATC ATCACGGCCT TCTTCAAGAC CGCGCACCTC TTTGTCAACT 

1001 ACGGCGCTGT GCCCGAGACG GCGCAGATCT TCACGCTCAA AGAGTCGGCC 

1051 GCGGCCGCCA AGGCCAAGTC GGACTAA 

FIG. 2 
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Amino Acid Sequence of an Omega -3 Fatty Acid i Desatirae (SDD17) from 

Saprolegnia did ma (ATCC 56851) 

1 MTEDKTKVEF PTLTELKHSI PNACFESNLG LSLYYTARAI FNASASAALL 
51 YAARSTPFIA DNVLLHALVC ATYIYVQGVI FWGFFTVGHD CGHSAFSRYH 
101 SVNFIIGCIM HSAILTPFES WRVTHRHHHK NTGNIDKDEI FYPHRSVKDL 
151 QDVRQWVYTL GGAWFVYLKV GYAPRTMSHF DPWDPLLLRR ASAVIVSLGV 
201 WAAFFAAYAY LTYSLGFAVM GLYYYAPLFV FASFLVITTF LHHNDEATPW 
251 YGDSEWTYVK GNLSSVDRSY GAFVDNLSHH IGTHQVHHLF PIIPHYKLNE 
301 ATKHFAAAYP HLVRRNDEPI ITAFFKTAHL FVNYGAVPET AQIFTLKESA 
351 AAAKAKSD* 

FIG. 3 
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Comparative analysis of S. diclina Delta 17-desaturase (SDD17.pep) 
& Synechocystis sp. Delta 15-desaturase (SYCDESB) 

Frame: 2 initn: 733 initl: 305 opt: 689 Z-score: 996.8 expectO: 1.5e-47 
40. 9X identity in 269 aa overlap 
(76-336:204-471) 



SDD17 . pep 
SYCDESB 

SDD17.pep 
SYCDESB 

SDD17 . pep 
SYCDESB 

SDD17 . pep 
SYCDESB 

SDD17.pep 
SYCDESB 



90 



100 



50 60 70 80 

SAALLYMRSTPFIADNVLLHALVCATYIYVQGVIFWGFFTVGHDC6HSAFSRYHSVNFI 

YFFLDVGLIAGPfAUVAYLDSWFFYPII^LiQGTLFWSLFWGHDCGHGSFSKSKTLNNW 
560 590 620 650 680 



530 



150 



160 



110 120 130 140 

I GC I MHS AI LTP FESWRVTHRHHHKNTGN I DKDE I FYPHRS VKDLQ - - - - DVRQWVYTLG 

ighlshtpilvpyhgwrishrthUU^ 

710 740 770 800 830 860 

170 180 190 200 . 210 220 

gawfvylkvgyaprtmshfdpwdplll-rrasavivslgvwaaffaayayltyslgfavm 
iaypiylfrrspnrqgshfmpgsplfrpgekaavltstfa 

950 980 1010 1040 



890 



920 



270 



280 



230 240 250 260 

glyyyaplfvfasflvittflhhndeatpwygdsewtyvkgnlssvdrsygafvdnlshh 

LKFYVAPYLVFV VWLDLVTFLHHTEDN I PWYRGDDWY FLKGALSTI DRDYG - F I NP I HHD 
1070 1100 1130 1160 1190 1220 

290 300 310 320 330 

IGTHQVHHLFPI IPHYKLNEATKHFAAAYPHLVRRNDEPI ITAFFKT- - -AHLFVNYGAV 

igthvahhifsnmMyklrrateaikpilgeyyr^ 

1250 1280 1310 1340 1370 1400 



340 350 
SDD17.pep PETAQ I FTLKESAAAAKAKSD 

SYCDESB VYYQSPSNGGYQKKPXLILIESNQHREGRQYXMVLLPSDRLMRSMEEVKQSHSKRSALNQ 
1430 1460 1490 1520 1550 1580 



FIG. 4 
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Comparative analysis. of S. diclina Delta 17-desaturase (SDD17.pep) 
C. elegans Delta 17-desaturase (CELEFAT) 

Frame- 1 initn: 490 initl: 222 opt: 502 Z-score: 724.0 expect (): 2.3e-32 
31. 6S identity in 310 aa overlap 
(2-303:49-347) 



SDD17.pep 
CELFAT 

SDD17 . pep 
CELFAT 

SDD17.pep 
CELFAT 

SDD17.pep 
CELFAT 

SDD17 . pep 
CELFAT 

SDD17.pep 
CELFAT 

SDD17.pep 
CELFAT 



10 20 30 

MTEDKTKVEFPTLTELKHSIPNACFESNLGL 



VTODVLVDARASLEEKEAPIW 

ha : 170 200 



80 
40 



110 
50 



140 
60 



170 
70 



200 
80 



230 
90 



An 50 60 70 80 90 

SLYYTARAIFNASASMLLYAARSTPFIADNVLLHALVCATYIYVQGVIFWGFFT 

IirIlVCIDF UtTILYF^ATCYFG^YLLlrt-.-WFGFAL^DC 

260 ?qn 320 350 



100 110 120 130 . 

GHSAFSRYHSVNFI IGC I MHSAI LTP FESWRVTHRHHHKNTGN 

lhgsfIdnqnlndfighiafsplfspyfpWshkl^ 

410 440 470 500 



140 

DKD- -EIFYPHRSVKD 

DKDHGHVW I QDKDWE A 
530 560 



icn 160 170 180 190 200 

LQDVRQWV - - - - YTLGGAWF - VYLXVGYAPRTMSHFDPWDPLLLRRASAVIVSLGVWAAF 

MPSWKRWFNPIPFSGWLKW^pWtLFG^ 

590 620 650 680 710 /W 



210 



220 



230 



FAAYAYL 

II I 
VCAYIAL 



YSLGFAVMGLYYYAPLFVFASFLV 

: ::: 1 1 : : I I L : : M 
IAGSYSNWFWYYWVPLSFFGLMLV 

770 800 830 



240 



250 



260 



C^\J tJU 

TTFLHHNDEATPWYGDSEWTYVKGNLS 

VTYLQHVDDVAEVYEADEWSFVRGQTQ 
860 890 920 



270 



280 



SVDRSYGAFVDNLSHH 

TiDRYYGLGLDTTMl!lH 
950 



290 300 Jiu Jtu 

-GTHQVHHLFPIIPHYKLNEATKHFAAAYPHLVRRNDEPIITA 

tdghvahhffnkIpiW 

980 1010 1040 1070 1100 



310 



320 



330 340 350 

FFKTAHLFVNYGAVPETAQIFTLKESAAAAKAKSD 

NYDFFARFLWFNYKLDYLVHKTAGIMQFRTTLEEKA^XKNIPCRSRVQQQLLRFHR^ 
1130 1160 1190 

FIG. 5 
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Gene Sequence of sddl2, a Delta 12-Desaturase Gene 
from Sapro 7 egn i a diclina (MCC 56851) 

1 ATGTGCAAAG GTCAAGCTCC TTCCAAGGCC GACGTGTTCC ACGCTGCGGG 

51 GTACCGCCCG GTCGCCGGCA CGCCCGAGCC GCTGCCGCTG GAGCCCCCGA 

101 CGATCACGCT CAAGGACCTG CGCGCGGCGA TCCCGGCCCA CTGCTTTGAG 

151 CGCAGCGCTG CCACTAGCTT TTACCATTTG GCCAAGAACC TTGCGATCTG 

201 CGCCGGCGTG TTCGCCGTTG GCCTCAAGCT CGCGGCTGCC GACTTGCCGC 

251 TCGCGGCCAA GCTGGTCGCG TGGCCCATCT ACTGGTTCGT CCAGGGCACG 

301 TACTTTACGG GCATCTGGGT CATTGCGCAC GAATGCGGCG ACCAGGCGTT 

351 CTCGGCGTCC GAGATCCTCA ACGACACGGT CGGTATCATT CTTCACTCGC 

401 TCCTCTTYGT GCCGTACCAC AGCTGGAAGA TCACGCACCG CCGCCACCAC 

451 TCCAACACGG GCAGCTGCGA GAACGACGAG GTGTTTACGC CGACGCCGCG 

501 GTCCGTCGTC GAGGCCAAGC ACGACCACTC GCTCCTCGAA GAGAGCCCGC 

551 TCTACAACCT GTACGGCATC GTCATGATGC TTCTCGTGGG CTGGATGCCG 

601 GGCTACCTCT TCTTCAACGC GACCGGCCCG ACCAAGTACG CTGGCCTCGC 

651 CAAGTCGCAC TTCAACCCGT ACGCAGCCTT TTTCCTCCCA AAGGAGCGCC 

701 TCAGCATCTG GTGGAGCGAC CTCTGCTTCC TCGCGGCCTT GTACGGCTTT 

751 GGCTACGGCG TCTCGGTCTT CGGCCTCCTC GATGTCGCCC GCCACTACAT 

801 CGTGCCGTAC CTCATTTGCA ACGCGTACCT CGTGCTCATC ACGTACCTCC 

851 AGCACACGGA TACGTACGTG CCCCACTTCC GCGGCGACGA GTGGAACTGG 

901 CTGCGCGGCG CGCTCTGCAC CGTCGACCGC TCGTTCGGCG CGTGGATCGA 

951 CAGCGCGATC CACCACATTG CCGACACGCA CGTGACGCAC CACATTTTCT 

1001 GCMGACGCC CTTCTACCAC GCGATCGAGG CGACCGACGC CATCACGCCC 

1051 CTCCTCGGCA AGTACTACCT CATCGACCCG ACGCCGATCG CGCTGGCGCT 

1101 CTGGCGCTCG TTCACGCACT GCAAGTACGT CGAGGACGAC GGCAACGTTG 

1151 TGTTTTACAA GCGCAAGCTC GAGGAAAAGT AA 

FIG. 6 
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Amino Acid Sequence of a Delta 12-Desaturase (SDD12) from Saprolegn 

diclina (ATCC 56851) 

1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 

51 RSAATSFYHL AKNLAICAGV FAVGLKLAAA DLPLAAKLVA WPIYWFVQGT 

101 YFTGIWVIAH ECGHQAFSAS EILNDTVGII LHSLLFVPYH SWKITHRRHH 

151 SNTGSCENDE VFTPTPRSVV EAKHDHSLLE ESPLYNLYGI VMMLLVGWMP 

201 GYLFFNATGP TKYAGLAKSH FNPYAAFFLP KERLSIWWSD LCFLAALYGF 

251 GYGVSVFGLL DVARHYIVPY LICNAYLVLI TYLQHTDTYV PHFRGDEWNW 

301 LRGALCTVDR SFGAWIDSAI HHIADTHVTH HIFSKTPFYH AIEATDAITP 

351 LLGKYYLIDP TPIPLALWRS FTHCKYVEDD GNVVFYKRKL EEK* 

FIG.7 
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Comparative analysis of S. did in* , Delta 12-desaturase ( S DD12.pep) 
& G. hirsutum Delta 12-desaturase (GH06DES) 

Frame- 3 initn: 992 initl: 413 opt: 1086 Z-score: 1531.8 expectO : 2.3e-77 
45.62 identity in 379 aa overlap 
(9-380:14-384) 

10 20 30 40 

MCKGQAPSKADVFHAAGYRPVAGTPEP LPLEPPTITLKDLPJ^AIPAHC 

lrvsstwrxtaffiU^kAgaggrmp^ 

10 40 70 100 130 160 



SDD12.pep 
GH06DESAT 

SDD12 . pep 
GH06DESAT 



c n en 70 80 90 iuu 

FERSMTSFYHLAKNUVICAGVFAVGLKUW\DLPU\AKLVAWPn 

frAsllrsfswi^ 

190 220 250 280 310 

110 120 . 130 140 

LHSLLFVPYHSWK 



100 



SDD12.pep 
GH06DESAT 



J.U ibw 

AHECGHQAFSASEI LNDTVGI 

IhewghhIfrdyqwvd 

370 400 



LHSALLVPYFSWK 
430 460 

200 



150 160 

thrrhhsntgscendevftptprs 

KPKS 

490 520 



Ml 

i7n 1R0 190 200 210 220 

SDD12 . pep VVEAIW^ 

! \ * ' ' \ ' 1 • II I I I 1 I'll ').. « JJiwJirk\/or>T\/ 



GH06DESAT 

SDD12 . pep 
GH06DESAT 



KLSC- • -FAKYLNNPPGRVLSLivTtTLGi- WIafAvsA- -RYYDR^lUpYGPIY 



550 



580 



610 



640 



230 240 <^U zou c/u 

LPKERLSIWWSDLCFLAALYGFGYGVSVFGLLDVARHYIVPYL 

SDRERiQVYISDTGiFAViYVLYKIAAkGLAWLLCTYGVPLL 
700 730 760 790 820 



250 



260 



270 



670 
280 



CNAY LV L I TY LQHTDT 
850 



290 300 310 320 330 340 

SDD12 . pep YVPHFRGDEWNWLRGALCTVDRSFGAWIDSAIHHIADTHVTHHI^ 

GH06DESAT ALpiYDSSEWDiLRGALST^DRDFGV - LNKV FHN I TDTHVAHH LFSTMPH YHAME ATKA 
880 910 940 970 1000 1030 

350 360 370 380 390 

SDD12 . pep TPLLGKYYLIDPTPIPLALWRSFTHCKYVEDDGNVVFYKRKLEEK 



GH06DESAT KF 
1060 



PLLuK.YYLJ.Ur f irLHLwror in^M vtuuuiim i iximvi-i-i-ix 

PILoWipFDGlplYlJ^WREAKECLWEPuVG^ 
0 1090 1120 1150 1180 1210 



FIG. 8 
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Sequence ID: 

5 !q ATC°CGC D GCC GCC ATC CCC AAG CAC TGC TGG GTC AAG-3 
l' qU GCC e CTC TTC GTC CTC GGC CAY GAC TGC GGC CAY GGC TCG TTC TCG-3" 
5 !q G^G C RTG D GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG-3' 
t'^TA^CAY GGC TGG CGC ATC TCG CAY CGC ACC CAY CAY CAG AAC-3" 
S'^CT^RTG RTG GGT CCG RTG CGA GAT GCG CCA GCC RTG GTA GGG-3' 
l- qU lSc e TCG CAC TTC SAC CCC KAC TCG GAC CTC TTC GTC-3' 
5 • qU GAC e GM GAG GTC CGA GTM GGG GTW GAA GTG CGA GCC - 3 ' ' 
5- qU GCG 6 CTG GAK GGT GGT GAG GCC GCC GCG GAW GSA CGA CCA-3' 
I- qU C?G e GG^ GAA GAG RTG RTG GAT GAC RTG GGT GCC GAT GTC RTG RTG-3" 
|' qU GCT e GG? CTC GAY GAG RTG GTA RTG GGG GAT CTK GGG GAA GAR RTG-3' 

5' q GAG C RTG D GTA RTG GGG GAT CTG GGG GAA GAR RTG RTG GRY GAC RTG- 3' 

Sequence ID 12 „ „, 

5--TAC GCG TAC CTC ACQ TAC TCG CTC G-3 

5^ q TTC C TTG D CAC CAC AAC GAC GAA GCG ACG-3' 
Sequence ID 14 

5 -GGA GTG GAC GTA CGT CAA GGG CAA C-3 
5- q TCA C AGG D GCA ACC TCT CGA GCG TCG AC-3' 
5' q CCC C AGT D CAC GAC GTT GTA AAA CGA CGG CCA G-3' 

FIG.9A 
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Sequence ID 17 

5' - AGC GGA TAA CAA TTT CAC ACA G6A AAC AGC -3' 



CGT CCT TGT CGA TGT TGC-3' 

CAT CGT GC-3' 

TGA GGT ACG CGT AC-3' 

ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC CCG-3 
CGC TTC CTA GTC TTA GTC 
ATG ACC GAG GAT AAG ACG AAG GTC GAG TTC CCG-3 
CGC TTC CTA GTC TTA GTC 



Sequence ID 18 

5* -GGT AAA AGA TCT 

Sequence ID 19 

5' -GTC AAA GTG GCT 

Sequence ID 20 

5' -CGA GCG AGT ACG 

Sequence ID 21 
5'-TCA ACA GAA TTC 

Sequence ID 22 

5" -AAA AGA AAG CTT 

Sequence ID 23 
5'-TCA ACA AAG CTT 

Sequence ID 24 

5' -AAA AGA GAA TTC 



Sequence 


ID 25 


1 


ATGACTGAGG 


51 


GCACTCGATC 


101 


ACTACACGGC 


151 


TACGCGGCGC 


201 


GCTCGTTTGC 


251 


TCTTCACGGT 


301 


AGCGTCAACT 


351 


GTTCGAGAGC 


401 


ACATTGATAA 


451 


CAGGACGTGC 


501 


CTTGAAGGTC 


551 


ACCCGCTCCT 


601 


TGGGCCGCCT 


651 


TGCCGTCATG 


701 


TCCTCGTCAT 


751 


TACGGCGACT 


801 


CCGCTCGTAC 


851 


ACCAGGTCCA 


901 


GCCACCAAGC 


951 


CGAGCCCATC 


1001 


ACGGCGCTGT 


1051 


GCGGCCGCCA 



ATAAGACGAA GGTCGAGTTC 
CCGAACGCGT GCTTTGAGTC 
CCGCGCGATC TTCAACGCGT 
GCTCGACGCC GTTCATTGCC 
GCCACCTACA TCTACGTGCA 
CGGCCACGAC TGCGGCCACT 
TTATCATCGG CTGCATCATG 
TGGCGCGTGA CGCACCGCCA 
GGACGAGATC TTTTACCCGC 
GCCAATGGGT CTACACGCTC 
GGGTATGCCC CGCGCACGAT 
CCTTCGCCGC GCGTCGGCCG 
TCTTCGCCGC GTACGCGTAC 
GGCCTCTACT ACTATGCGCC 
TACGACCTTC TTGCACCACA 
CGGAGTGGAC GTACGTCAAG 
GGCGCGTTCG TGGACAACCT 
CCACTTGTTC CCGATCATTC 
ACTTTGCGGC CGCGTACCCG 
ATCACGGCCT TCTTCAAGAC 
GCCCGAGACG GCGCAGATCT 
AGGCCAAGTC GGACTAA 



CGA CTT GGC CTT GGC-3' 



CGA CTT GGC CTT GGC-3' 



CCGACGCTCA CGGAGCTCAA 
GAACCTCGGC CTCTCGCTCT 
CGGCCTCGGC GGCGCTGCTC 
GATAACGTTC TGCTCCACGC 
GGGCGTCATC TTCTGGGGCT 
CGGCCTTCTC GCGCTACCAC 
CACTCTGCGA TTTTGACGCC 
CCACCACAAG AACACGGGCA 
ACCGGTCGGT CAAGGACCTC 
GGCGGTGCGT GGTTTGTCTA 
GAGCCACTTT GACCCGTGGG 
TCATCGTGTC GCTCGGCGTC 
CTCACATACT CGCTCGGCTT 
GCTCTTTGTC TTTGCTTCGT 
ACGACGAAGC GACGCCGTGG 
GGCAACCTCT CGAGCGTCGA 
GAGCCACCAC ATTGGCACGC 
CGCACTACAA GCTCAACGAA 
CACCTCGTGC GCAGGMCGA 
CGCGCACCTC TTTGTCAACT 
TCACGCTCAA AGAGTCGGCC 



FIG.9B 
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Sequence ID 26 



1 


MTEDKTKVEF 


PTLTELKHSI 


51 


YAARSTPFIA 


DNVLLHALVC 


101 


SVNFIIGCIM 


HSAILTPFES 


151 


QOVRQWVYTL 


GGAWFVYLKV 


201 


WAAFFAAYAY 


LTYSLGFAVM 


251 


YGDSEWTYVK 


GNLSSVDRSY 


301 


ATKHFAAAYP 


HLVRRNDEPI 


351 


AAAKAKSD* 





Sequence ID 27 

1 ATGGCCCCGC AGACGGAGCT 

51 GCCGGTGGCC GGCAAGAAGG 

101 ACACGGCGGC CTCGGCCTGG 

151 ACCGAGTGGG CCAACAAGCA 

201 CGCCGGTCGC GAGGCCACCG 

251 ACAAGGCCGA GTCGATCTTG 

301 CCGTCCGAGT TTCCGACCTT 

351 CCGCAAGCGC GTTGGCGAGT 

401 ACGGCTTCCC GGGCCTCTGG 

451 CTCGCCTTGT ACGGCATGCA 

501 GGCCGCGGCG CTCTTTGGCG 

551 TGCACGACTC GTCGCACGCG 

601 GTCGTCGGCC GCTTTGCCAT 

651 ATGGCTCAAC CAGCACGTCG 

701 GCTCGGACCC GGATCTTCCG 

751 GTGAACCGCC AGGTGTTCCA 

801 TCCGCCGCTC TATGGCGTGC 

851 CCGACACGTT CGGCTCGCAC 

901 GCGCTCTCGA CGTGGATGGC 

951 CTACCGCGTG TACCTTCCGC 

1001 ACCTTGCGAT CTTCTTCCTC 

1051 TTCAACTTCC AAGTAAGCCA 

1101 CGACGAGGCC AAGATGGCGC 

1151 AGACGTCGGT CGACTACGCC 

1201 GGCGCGCTCA ACTACCAGGT 

1251 GTACCACTAC CCGGCGATCG 

1301 ACAACATCAA GTACGCCATC 

1351 CACTTGAAGC ACCTCCGCAA 

1401 CCACATGGGC TAA 



PNACFESNLG 


LSLYYTARAI 


FNASASAALL 


ATYIYVQGVI 


FWGFFTVGHD 


CGHSAFSRYH 


WRVTHRHHHK 


NTGNIDKDEI 


FYPHRSVKDL 


GYAPRTMSHF 


DPWDPLLLRR 


ASAVIVSLGV 


GLYYYAPLFV 


FASFLVITTF 


LHHNDEATPW 


GAFVDNLSHH 


IGTHQVHHLF 


PIIPHYKLNE 


ITAFFKTAHL 


FVNYGAVPET 


AQIFTLKESA 



CCGCCAGCGC CACGCCGGCG TCGCCGAGAC 
CCTTTACATG GCAGGAGGTC GCGCAGCACA 
ATCATTATCC GCGGCAAGGT CTACGACGTG 
CCCCGGCGGC CGCGAGATGG TGCTGCTGCA 
ACACGTTCGA CTCGTACCAC CCGTTCAGCG 
AACAAGTATG AGATTGGCAC GTTCACGGGC 
CAAGCCGGAC ACGGGCTTCT ACAAGGAGTG 
ACTTCAAGAA GAACMCCTC CATCCGCAGG 
CGCATGATGG TCGTGTTTGC GGTCGCCGGC 
CTTTTCGACT ATCTTTGCGC TGCAGCTCGC 
TCTGCCAGGC GCTGCCGCTG CTCCACGTCA 
TCGTACACCA ACATGCCGTT CTTCCATTAC 
GGACTGGTTT GCCGGCGGCT CGATGGTGTC 
TGGGCCACCA CATCTACACG AACGTCGCGG 
GTCAACATGG ACGGCGACAT CCGCCGCATC 
GCCCATGTAC GCATTCCAGC ACATCTACCT 
TTGGCCTCAA GTTCCGCATC CAGGACTTCA 
ACGAACGGCC CGATCCGCGT CAACCCGCAC 
CATGATCAGC TCCAAGTCGT TCTGGGCCTT 
TTGCCGTGCT CCAGATGCCC ATCAAGACGT 
GCCGAGTTTG TCACGGGCTG GTACCTCGCG 
TGTCTCGACC GAGTGCGGCT ACCCATGCGG 
TCCAGGACGA GTGGGCAGTC TCGCAGGTCA 
CATGGCTCGT GGATGACGAC GTTCCTTGCC 
CGTGCACCAC TTGTTCCCCA GCGTGTCGCA 
CGCCCATCAT CGTCGACGTC TGCAAGGAGT 
TTGCCGGACT TTACGGCGGC GTTCGTTGCC 
CATGGGCCAG CAGGGCATCG CCGCCACGAT 



FIG.9C 
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Sequence 


t r> on 

ID 28 










1 


a too o a a a o a 

ATGGCAAACA 


oo a oo ototo 

GCAGCGTGTG 


OO A TO A TO^"« 

GGATGATGTG 


GTGGGCCGCG 


TGGAGACCGG 


51 


K> A O O A o 

CGTGGACCAG 


TOO A TOO A TO 

TGGATGGATG 


GCGCCAAGCC 


GTACGCACTC 


ACCGATGGGC 


101 


-f-y"> O O O A TO A *T* 

TCCCGATGAT 


O O A O ototp o 

GGACGTGTCC 


AOO ATOO"T*00 

ACCATGCTGG 


CATTCGAGGT 


GGGATACATG 


151 


ooo a TprTP^ 

GCCATGCTGC 


TO i 1 I'OOOO ATT 

TCTTCGGCAT 


O O O O A *T" A to 

CCCGATCATG 


AAGCAGATGG 


AGAAGCCTTT 


201 


TGAGCTCAAG 


A OO A TO A A O O 

ACCATCAAGC 


TCTTGCACAA 


CTTGTTTCTC 


TTCGGACTTT 


251 


o O ll'PT AO AT 

CCTTGTACAT 


oto o otoo A O 

GTGCGTGGAG 


A O O A TO O O O O 

ACCATCCGCC 


AGGCTATCCT 


CGGAGGCTAC 


301 


AAA OTOTTT/> 

AAAGTGTTTG 


OA A A O O A O A "T" 

GAAACGACAT 


OO A O A a oono 

GGAGAAGGGC 


AACGAGTCTC 


ATGCTCAGGG 


351 


CATGTCTCGC 


ATCGTGTACG 


TGTTCTGCGT 


GTCCAAGGCA 


TACGAGTTCT 


401 


TGGATACCGC 


CATCATGATC 


CTTTGCAAGA 


AGTTCAACCA 


GGTTTCCTTC 


A C 1 

451 


1 1 oo ATPTOT 

TTGCATGTGT 


A o/-» a oo A TOP 

ACCACCATGC 


CACCAl 1 1 1 1 


GCCATCTGGT 


GGGCTATCGC 


501 


CAAGTACGCT 


CCAGGAGGTG 


ATGCGTACTT 


TTCAGTGATC 


CTCAACTCTT 


551 


TCGTGCACAC 


CGTCATGTAC 


GCATACTACT 


TCTTCTCCTC 


CCAAGGGTTC 


601 


GGGTTCGTGA 


AGCCAATCAA 


GCCGTACATC 


ACCACCCTTC 


AGATGACCCA 


651 


GTTCATGGCA 


ATGCTTGTGC 


AGTCCTTGTA 


CGACTACCTC 


TTCCCATGCG 


701 


ACTACCCACA 


GGCTCTTGTG 


CAGCTTCTTG 


GAGTGTACAT 


GATCACCTTG 


751 


CTTGCCCTCT 


TCGGCAACTT 


TTTTGTGCAG 


AGCTATCTTA 


AAAAGCCAAA 


801 


AAAGAGCAAG 


ACCAACTAA 








Sequence 


ID 29 










1 


MTVGFDETVT 


MDTVRNHNMP 


DDAWCAIHGT 


VYDITKFSKV. 


HPGGDIIMLA 


51 


AGKEATILFE 


TYHIKGVPDA 


VLRKYKVGKL 


PQGKKGETSH 


MPTGLDSASY 


101 


YSWDSEFYRV 


LRERVAKKLA 


EPGLMQRARM 


ELWAKAIFLL 


AGFWGSLYAM 


151 


CVLDPHGGAM 


VAAVTLGVFA 


AFVGTCIQHD 


GSHGAFSKSR 


FMNKAAGWTL 


201 


DMIGASAMTW 


EMQHVLGHHP 


YTNLIEMENG 


LAKVKGADVD 


PKKVDQESDP 


251 


DVFSTYPMLR 


LHPWHRQRFY 


HKFQHLYAPL 


IFGFMTINKV 


ISQDVGVVLR 


301 


KRLFQIDANC 


RYGSPWNVAR 


FWIMKLLTTL 


YMVALPMYMQ 


GPAQGLKLFF 


351 


MAHFTCGEVL 


ATMFIVNHII 


EGVSYASKDA 


VKGVMAPPRT 


VHGVTPMQVT 


401 


QKALSAAEST 


KSDADKTTMI 


PLNDWAAVQC 


QTSVNWAVGS 


WFWNHFSGGL 


451 


NHQIEHHCFP 


QNPHTVNVYI 


SGIVKETCEE 


YGVPYQAEIS 


LFSAYFKMLS 


501 


HLRTLGNEDL 


TAWST* 









Sequence ID 30 

5'-CCG SAG TTC ACS ATC AAG GAG ATC CGC GAS KSC ATC CCG GCC CAC TGC TTC 
Sequence ID 31 

5'-GRS CTT CTT GAK GTG GWM SGT GGC CTC CTC GGC GTG GTA GWR CGG CAT- 3" 
Sequence ID 32 

5' -CCS STC TAC TGG GCC TGC CAG GGT RTC GTC CTC ACS GGT GTC TGG-3 ' 



FIG.9D 
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Sequence ID 33 

5* -CCS STC TAC TGG ATC RYS CAG GGT RTC GTC KGY ACS GGT GTC TGG-3' 
Sequence ID 34 

5 ' - GGC GTG GTA GTG CGG CAT SMM CGA GAA GAR GTG GTG GGC GAC GTG-3' 
Sequence ID 35 

5' -CAC GTA CCT CCA GCA CAC GGA CAC CTA CG-3' 
Sequence ID 36 

5' - GAT CGA CAG CGC GAT CCA CCA CAT TGC -3' 
Sequence ID 37 

5' - CAA ATG GTA AAA GCT AGT GGC AGC GCT GC-3' 
Sequence ID 38 

5 '-AGT ACG TGC CCT GGA CGA ACC AGT AGA TG-3' 
Sequence ID 39 

5 " - TCA ACA GAA TTC ATG TGC AAA GGT CAA GCT CCT TCC AAG GCC GAC GTG • 
Sequence ID 40 

5 ' - AAA AGA AAG CTT TTA CTT TTC CTC GAG CTT GCG CTT GTA AAA CAC AAC-3 

Sequence ID 41 

1 ATGTGCAAAG GTCAAGCTCC TTCCAAGGCC GACGTGTTCC 

51 GTACCGCCCG GTCGCCGGCA CGCCCGAGCC GCTGCCGCTG 

101 CGATCACGCT CAAGGACCTG CGCGCGGCGA TCCCGGCCCA 

151 CGCAGCGCTG CCACTAGCTT TTACCATTTG GCCAAGAACC 

201 CGCCGGCGTG TTCGCCGTTG GCCTCAAGCT CGCGGCTGCC 

251 TCGCGGCCAA GCTGGTCGCG TGGCCCATCT ACTGGTTCGT 

301 TACTTTACGG GCATCTGGGT CATTGCGCAC GAATGCGGCC 

351 CTCGGCGTCC GAGATCCTCA ACGACACGGT CGGTATCATT 

401 TCCTCTTTGT GCCGTACCAC AGCTGGAAGA TCACGCACCG 

451 TCCAACACGG GCAGCTGCGA GAACGACGAG GTGTTTACGC 

501 GTCCGTCGTC GAGGCCAAGC ACGACCACTC GCTCCTCGAA 

'551 TCTACAACCT GTACGGCATC GTCATGATGC TTCTCGTGGG 

601 GGCTACCTCT TCTTCAACGC GACCGGCCCG ACCAAGTACG 

651 CAAGTCGCAC TTCAACCCGT ACGCAGCCTT TTTCCTCCCA 

701 TCAGCATCTG GTGGAGCGAC CTCTGCTTCC TCGCGGCCTT 

751 GGCTACGGCG TCTCGGTCTT CGGCCTCCTC GATGTCGCCC 

801 CGTGCCGTAC CTCATTTGCA ACGCGTACCT CGTGCTCATC 

851 AGCACACGGA TACGTACGTG CCCCACTTCC GCGGCGACGA 

901 CTGCGCGGCG CGCTCTGCAC CGTCGACCGC TCGTTCGGCG 

951 CAGCGCGATC CACCACATTG CCGACACGCA CGTGACGCAC 

1001 CCAAGACGCC CTTCTACCAC GCGATCGAGG CGACCGACGC 

1051 CTCCTCGGCA AGTACTACCT CATCGACCCG ACGCCGATCC 

1101 CTGGCGCTCG TTCACGCACT GCAAGTACGT CGAGGACGAC 

1151 TGTTTTACAA GCGCAAGCTC GAGGAAAAGT AA 



FIG.9E 



ACGCTGCGGG 
GAGCCCCCGA 
CTGCTTTGAG 
TTGCGATCTG 
GACTTGCCGC 
CCAGGGCACG 
ACCAGGCGTT 
CTTCACTCGC 
CCGCCACCAC 
CGACGCCGCG 
GAGAGCCCGC 
CTGGATGCCG 
CTGGCCTCGC 
AAGGAGCGCC 
GTACGGCTTT 
GCCACTACAT 
ACGTACCTCC 
GTGGAACTGG 
CGTGGATCGA 
CACATTTTCT 
CATCACGCCC 
CGCTGGCGCT 
GGCAACGTTG 
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Sequence ID 42 

1 MCKGQAPSKA DVFHAAGYRP VAGTPEPLPL EPPTITLKDL RAAIPAHCFE 

51 RSAATSFYHL AKNLAICAGV FAVGLKLAAA DLPLAAKLVA WPIYWFVQGT 

101 YFTGIWVIAH ECGHQAFSAS EILNDTVGII LHSLLFVPYH SWKITHRRHH 

151 SNTGSCENDE VFTPTPRSVV EAKHDHSLLE ESPLYNLYGI VMMLLVGWMP 

201 GYLFFNATGP TKYAGLAKSH FNPYAAFFLP KERLSIWWSD LCFLAALYGF 

251 GYGVSVFGLL DVARHYIVPY LICNAYLVLI TYLQHTDTYV PHFRGDEWNW 

301 LRGALCTVDR SFGAWIDSAI HHIADTHVTH HIFSKTPFYH AIEATDAITP 

351 LLGKYYLIDP TPIPLALWRS FTHCKYVEDD GNVVFYKRKL EEK* 



FIG.9F 



